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SEQUENCE LISTING 

GENERAL INFORMATION: 

(i) APPLICANT: Huston, James S. 
v ' Houston, L. L. 

Ring, David B. 

Oppermann, Hermann 

TITLE OF INVENTION: Eiosynthetic Binding Proteins For 
Imaging 

(iii) NUMBER OF SEQUENCES: II 

(iv ) CORRESPONDENCE ADDRESS: Xhibe ault/Patent Department 

( { A) ADDRESSEE: Jesta, Hurvxtz & I street 

(B) STREET: Exchange Place, 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(V) COMPUTER READABLE FORM: 
( (k) MEDIUM TYPE: Floppy 

B COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: PC-DOS /MS-DOS ^ 
(D) SOFTWARE: Patentln Release ffi.u, 

(vi) CURRENT APPLICATION DATA: 
{ ' (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(ix) TELECOMMUNICATION IN™f^° N: 
( (k) TELEPHONE: 617-248-7477 

(B) TELEFAX: 617-248-7100 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



) 



) 
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(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 752 fl7 , lf , ft _ Fw , c-terminal 
(D) OTHER INFORMATION: /product= "741F8 sFv C termxna 

Gly4-Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

"S22222222S522221S 
S 2 2 2 S 2 S 2 5S i S 2 S 2 2 2 

20 25 

ACA AAC TAT GGA ATG AAC TGG GIG AAG CAG GCT CCA GGA AAG GGT TTA 
Thr Asn Tyr Gly Met Asn Trp Val Lys Gin Ala Pro wy y 
35 40 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

50 55 

2 2 2 2 2 S ^ 0 2 2 2 ,2 2 2 2 2 2 
2222222222222222 

80 85 9° 

« TTC TGT GGA f G CAA TIT ATT ACC TAG « « « « 2 2 
Tyr Phe Cys Gly Arg Gin Phe He Thr lyr wy m 
100 105 

2 Si g 2 2 2 2 5 2 2 22 2 2 2 2 

115 120 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

130 135 

AAA TTC ATG TCC ACG TCA GTG GGA GAC AGG GTC AGC ATC TCC TGC AAG 
Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser lie y 



145 150 



> 
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GCC AGT CAG GAT GTG AGT ACT GCT GTA GCC TGG TAT CAA CAA AAA CCA 
Ala Ser Gin Asp Val Ser Thr Ala Val Ala Trp iyr ox ^ 
160 165 

GGG CAA TCT CCT AAA CTA CTG ATT TAC TGG ACA TCC ACC CGG CAC ACT 
Gly Gin Ser Pro Lys Leu Leu lie Tyr Trp Thr Ser inr b 
180 185 

GGA GTC CCT GAT CGG TTC ACA GGG AGT GGA TCT GGG ACA GAT TAT ACT 
Gly Val Pro Asp Arg Ehc Thr Gly Ser Gly Ser Gly inr 

« S ffi ^ S S giu 25 S fit g s ^ s: 

210 215 1 

225 230 
GAG ATA AAA CGG GCT GAT GGG GGA GGT GGA TGT TAACGGGGGA GGTGGATGTT 
Glu He Lys Arg Ala Asp Gly Gly Gly Gly Cys 
240 245 250 

GGGTCTCGTT ACGTTGCGGA TCTCGAGGCT ATCTTTACTA ACTCTTACCG TAAAGTTCTG 
GCTCAACTGT CTGCACGCAA GCTTTTGCAG GATATCATGA GCGCTTAAGA TCCGTCGACC 
TGCAGGCATG CAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Glu lie Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys_ Pro 
1 5 10 

Gly Glu Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 

20 25 
Asn Tyr Gly Met Asn Trp Val Lys Gin Ala Pro Gly Lys Gly Leu Lys 
35 A0 



527 

575 

623 

671 

719 

772 

832 
892 
909 



) 
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Trp Met Gly Trp He Asn Thr Asn Thr Gly Glu Fro Thr Tyr Ala Glu 

50 55 
Glu Phe Lys Gly Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Ser Thr 

65 70 75 

Ala Tyr Leu Gin He Asa Asa Leu Lys Asn Glu Asp Thr Ala Thr Tyr 
85 90 

Phe Cys Gly Arg Gin Phe He Thr Tyr Gly Gly Phe Ala Asn Trp Gly 

100 lo:> 
Gin Gly Thr Leu Val Thr Val Ser Ala Ser Ser Ser Ser Gly Ser Ser 



115 



120 



Ser ser Gly Ser Ser Ser Ser Asp lie Val Her Thr Gin Ser Pro Lys 

130 135 
Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser He Ser Cys Lys Ala 



145 150 



Ser Gin Asp Val Ser Thr Ala Val Ala Trp Tyr Gin Gin Lys Pro Gly 



165 



Gin Ser Pro Lys Leu Leu lie Tyr Trp Thr Ser Thr Arg His Thr Gly 
180 

Val Pro Asp Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu 

Thr lie Ser Ser Val Gin Ala Glu Asp Leu Ala Leu His Tyr Cys Gin 

210 215 220 

Gin His Tyr Arg Val Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu 
225 230 23 

He Lys Arg Ala Asp Gly Gly Gly Gly Cys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.-758 . h 
D OTHER INFORMATION: /product= "26-10 sFv vitn 

C- terminal Gly4-Cys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

« £ Si S ^ 2 2 S 2 2 2 2 2 2 £ 5 
2 S 2 S 2 2 2 Si 2 2 2 2 S K 2 

20 25 

GAC TTC TAC ATG AAT TGG GTT CGC CAG TCT CAT GGT AAG TCT « GAG 
Asp Phe Tyr Met Asn Trp Val Arg Gin Ser his uxy y 
35 A0 

2 £ 5 2 2 2 2 2 2 2 2 2 2 2 2 2 

50 55 

2222 2 2 2 2 2 2 2 £ 2 2 2 2 

65 70 75 

GCT TAC ATG GAG CTG CGT TCT TTG ACC TCT GAG GAC TCC GCG GTA TAC 
Ala Tyr Met Glu Leu Arg Ser Leu Thr Ser Glu Asp ber a ^ 
80 85 

222 g 2 2 2 2 2 Sj S - 5 " 3 * 
2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
2 2 2 2 2 2 £ 2 2 2 2 2 g 2 2 2 

130 135 

CTG CCG GTT TCT CTG GGT GAC CAG GCT TCT ATT TCT TGC CGC TCT TCC 

Leu Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser Cys Arg 
145 150 155 

r*r TPT CTG GTC CAT TCT AAT GGT AAC ACT TAC CTG AAC TGG TAC CTG 
3S 2u Val £s Ser Asn Gly Asn Thr Tyr Leu Asn Trp Tyr Leu 



47 

95 
143 

191 

239 

287 

335 

383 
431 
479 
527 



- 63 - 



CAA AAG GCT GGT CAG TCT CCG AAG CTT CTG ATC TAG AAA GTC TCT AAC 
Gin Lys Ala Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys 
180 185 

S 2 2 I 2 S 2 S 2 2 2 2 S 1 5 - 
Li 2 2 2 2 2 2 S 2 2 2 .2 § 2 2 2 

210 215 

£ 2 2 2 2 2 2 2 2 2 2 2 2 2 S 2 
££22222 '222225*^ 



240 245 
CCGGGGGATC C 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly 
1 5 10 

Ala Ser Val Arg Met Ser Cys Lys Ser Ser Gly Tyr lie Phe Thr Asp 

20 25 
Phe Tyr Met Asn Trp Val Arg Gin Ser His Gly. Lys Ser Leu Asp Tyr 

35 40 
lie Gly Tyr He Ser Pro Tyr Ser Gly Val Thr Gly Tyr Asn Gin Lys 

50 55 60 

Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala 

65 70 ° 

Tyr Met Glu Leu Arg Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr 
85 90 



575 

623 

671 

719 

768 

779 



) 
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Cys Ala Gly Ser Ser Gly A» Lys Tro Ala net Asp Tyr Trp Gly His 

100 105 
fly Ala Ser Val Thr Val Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser 



115 120 



Ser Ser Ser Gly Ser Asp Val Val Het Thr Gin Thr Pro Lee Ser Leu 

130 135 
Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser Cys Arg Ser Ser Gin 
145 150 

Ser Leu Val His Ser Asn Gly Asn Thr Tyr Leu Asn Trp Tyr Leu Gin 
165 170 



Lys Ala Gly Gin Ser Pre Lys Leu Leu lie Tyr Lys Val Ser Asn Arg 

180 18:> 
Phe Ser Gly Val Pre Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

Phe Thr Leu Lys He Ser Arg Val Glu Ala GLu a.j Leu Gly He Tyr 

210 215 
Phe Cys Ser Gin Thr Thr His Val Pro Pro Thr Phe Gly Gly Gly Thr 
225 230 

Lys Leu Glu lie Lys Arg Ser Gly Gly Gly Gly Cys 



245 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) H0LECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..729 „c 9nr<) sFv polypeptide 
(D) OTHER INFORMATION: /products 520C9 stv poxyp p 

sequence" 



65 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ss s £ £ a s £ s £ a a s £ s 5 s 

£ £ £ E S SS S 2 S £ - E ££ - 

20 25 

GGA ATG AAC TGG ATG AAG CAG GCT CCA GGA AAG GGT TTA AAG TGG ATG 
Gly Het Asn Trp Met Lys Gin Ala Pro Gly Lys Gly Leu y r 
35 

^ £ S E Thr Tyr E £ S S E ? 2 E E 2 

50 55 
AAG GAA CGG TTT GCC TIC TCT TTG GAA ACC TCT GCC ACC ACT GCC CAT 
Lys Glu Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala inr inr gQ 

65 70 7b 

TTG CAG ATC AAC AAC CTC AGA AAT GAG GAC TCG GCC ACA TAT TTC TGT 
Leu Gin lie Asn Asn Leu Arg Asn Glu Asp Ser Ala Thr Tyr rne y 
85 vu 

St £ £ S cfy e s s 5 s St S E E 5 E 

S E S E a 5 SS E S £ £ E I - 5 2 

115 120 1/3 

EEEEEEESEEEgSMS 

130 135 140 

S EE SS £ Git trg SS £ E £ 5 3 S £ £ 

!45 150 15:5 

£ £ £ ttn £ £ £ S £ £ SS £ £ 5 1 £ 

S % £ £ £ £ £ £ £ £ £ 5 £ § S £ 

ion 185 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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£ 52 £ S £ 5 1 S S S - I = - - 
£ £ 2 5 2 2 2 2 S - | 5 = - " 

210 215 220 

S 2 S S 2 2 2 S 2 5 | ^ 2 2 2 £ 

230 23b 



225 



CGG GCT GAT TAATCTGCAG 
Arg Ala Asp 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
Glu He Gin Leu Val Gin Ser Gly Pro Glu Leu Lys Lys Pro Gly Glu 



1 5 



Thr Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ala Asn Tyr 

20 25 
Gly Met Asn Trp Met Lys Gin Ala Pro Gly Lys Gly Leu Lys Tr P Met 



Gly Trp lie Asn Thr Tyr Thr Gly Gin Ser Thr Tyr Ala Asp Asp Phe 

50 55 
Lys Glu Arg Phe Ala Phe Ser Leu Glu Thr Ser Ala Thr Thr Ala His 
65 70 

Leu Gin lie Asn Asn Leu Arg Asn Glu Asp Ser Ala Thr Tyr Phe Cys 

85 90 
Ala Arg Arg Phe Gly Phe Ala Tyr Trp Gly Gin Gly Thr Leu Val Ser 



100 L05 
Ser 

115 120 



Val Ser Ala Ser He Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser 



624 



672 



720 



739 



) 
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Se, Ser Ser Gly Ser Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu 

130 135 
Ser Ala Ser Leu Gly Glu Arg Val Ser Leu Thr Cys Arg Ala Ser Gin 
145 150 155 

Asp lie Gly Asn Ser Leu Thr Trp Leu Gin Gin Glu Pro Asp Gly Thr 

165 170 
He Lys Arg Leu lie Tyr Ala Thr Ser Ser Leu Asp Ser Gly V,l Pro 

180 185 
Lys Arg Phe Ser Gly Ser Arg Ser Gly Ser Asp Tyr Ser Leu Thr lie 

Ser Ser Leu Glu Ser Glu Asp Phe Val Val Tyr Tyr Cys Leu Gin Tyr 

210 215 
Ala lie Phe Pro Tyr Thr Phe Gly Gly Gly Thr Asn Leu Glu lie Lys 
225 



230 235 



Arg Ala Asp 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "Limcer 1 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 



) 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..15 httmctr ?« 
(D) OTHER INFORMATION: /note= LINKER 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly Ser Ser Ser Ser Gly 
1 5 1U 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

$ S'iSfORMATION: /note- «C-Terminal Tail (Ser-Cys)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ser Cys 
1 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:, linear 



) 
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(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 



(B) LOCATION: 1..5 . 
(D) OTHER INFORMATION: /note= "C-Termmal Tail 
(Gly-Gly-Gly-Gly-Cys ) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Gly Gly Cys 
1 5 

INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..11 . , m . _ 
(D) OTHER INFORMATION: /note= "C-Terrainal Taxi 

(His-His-His-His-His-His-Gly-Gly-Gly-Gly-tys) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His His His His His His Gly Gly Gly Gly Cys 
1 5 10 



ti 



